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CCGGGTCGACCCACGCGTCCGAAGGCCCCCTCTCACTCCGCTCCACTCCTCGGGCTGG 
CTCTCCTGAGGATGCACCAGCGTCACCCCCGGGCAAGATGCCCTCCCCTCTGTGTGGC 
CGGAATCCTTGCCTGTGGCTTTCTCCTGGGCTGCTGGGGACCCTCCCATTTCCAGCAGA 
GTTGTCTTCAGGCTTTGGAGCCACAGGCCGTGTCTTCTTACTTGAGCCCTGGTGCTCCC 
TTAAAAGGCCGCCCTCCTTCCCCTGGCTTCCAGAGGCAGAGGCAGAGGCAGAGGCGGG 
CTGCAGGCGGCATCCTACACCTGGAGCTGCTGGTGGCCGTGGGCCCCGATGTCTTCCA 
GGCTCACCAGGAGGACACAGAGCGCTATGTGCTCACCAACCTCAACATCGGGGCAGAA 
CTGCTTCGGGACCCGTCCCTGGGGGCTCAGTTTCGGGTGCACCTGGTGAAGATGGTCA 
TTCTGACAGAGCCTGAGGGTGCTCCAAATATCACAGCCAACCTCACCTCGTCCCTGCTG 
AGCGTCTGTGGGTGGAGCCAGACCATCAACCCTGAGGACGACACGGATCCTGGCCATG 
CTGACCTGGTCCTCTATATCACTAGGTTTGACCTGGAGTTGCCTGATGGTAACCGGCAG 
GTGCGGGGCGTCACCCAGCTGGGCGGTGCCTGCTCCCCAACCTGGAGCTGCCTCATTA 
CCGAGGACACTGGCTTCGACCTGGGAGTCACCATTGCCCATGAGATTGGGCACAGCTT 
CGGCCTGGAGCACGACGGCGCGCCCGGCAGCGGCTGCGGCCCCAGCGGACACGTGA 
TGGCTTCGGACGGCGCCGCGCCCCGCGCCGGCCTCGCCTGGTCCCCCTGCAGCCGCC 
GGCAGCTGCTGAGCCTGCTCAGCGCAGGACGGGCGCGCTGCGTGTGGGACCCGCCGC 
GGCCTCAACCCGGGTCCGCGGGGCACCCGCCGGATGCGCAGCCTGGCCTCTACTACA 
GCGCCAACGAGCAGTGCCGCGTGGCCTTCGGCCCCAAGGCTGTCGCCTGCACCTTCGC 
CAGGGAGCACCTGGATATGTGCCAGGCCCTCTCCTGCCACACAGACCCGCTGGACCAA 
AGCAGCTGCAGCCGCCTCCTCGTTCCTCTCCTGGATGGGACAGAATGTGGCGTGGAGA 
AGTGGTGCTCCAAGGGTCGCTGCCGCTCCCTGGTGGAGCTGACCCCCATAGCAGCAGT 
GCATGGGCGCTGGTCTAGCTGGGGTCCCCGAAGTCCTTGCTCCCGCTCCTGCGGAGGA 
GGTGTGGTCACCAGGAGGCGGCAGTGCAACAACCCCAGACCTGCCTTTGGGGGGCGT 
GCATGTGTTGGTGCTGACCTCCAGGCCGAGATGTGCAACACTCAGGCCTGCGAGAAGA 
CCCAGCTGGAGTTCATGTCGCAACAGTGCGCCAGGACCGACGGCCAGCCGCTGCGCTC 
CTCCCCTGGCGGCGCCTCCTTCTACCACTGGGGTGCTGCTGTACCACACAGCCAAGGG 
GATGCTCTGTGCAGACACATGTGCCGGGCCATTGGCGAGAGCTTCATCATGAAGCGTG 
GAGACAGCTTCCTCGATGGGACCCGGTGTATGCCAAGTGGCCCCCGGGAGGACGGGA 
CCCTGAGCCTGTGTGTGTCGGGCAGCTGCAGGACATTTGGCTGTGATGGTAGGATGGA 
CTCCCAGCAGGTATGGGACAGGTGCCAGGTGTGTGGTGGGGACAACAGCACGTGCAGC 
CCACGGAAGGGCTCTTTCACAGCTGGCAGAGCGAGAGAATATGTCACGTTTCTGACAGT 
TACCCCCAACCTGACCAGTGTCTACATTGCCAACCACAGGCCTCTCTTCACACACTTGG 
CGGTGAGGATCGGAGGGCGCTATGTCGTGGCTGGGAAGATGAGCATCTCCCCTAACAC 
CACCTACCCCTCCCTCCTGGAGGATGGTCGTGTCGAGTACAGAGTGGCCCTCACCGAG . 




GACCGGCTGCCCCGCCTGGAGGAGATCCGCATCTGGGGACCCCTCCAGGAAGATGCT 

GACATCCAGGTTTACAGGCGGTATGGCGAGGAGTATGGCAACCTCACCCGCCCAGACA 

TCACCTTCACCTACTTCCAGCCTAAGCCACGGCAGGCCTGGGTGTGGGCCGCTGTGCG 

TGGGCCCTGCTCGGTGAGCTGTGGGGCAGGGCTGCGCTGGGTAAACTACAGCTGCCTG 

GACCAGGCCAGGAAGGAGTTGGTGGAGACTGTCCAGTGCCAAGGGAGCCAGCAGCCA 

CCAGCGTGGCCAGAGGCCTGCGTGCTCGAACCCTGCCCTCCCTACTGGGCGGTGGGA 

GACTTCGGCCCATGCAGCGCCTCCTGTGGGGGTGGCCTGCGGGAGCGGCCAGTGCGC 

TGCGTGGAGGCCCAGGGCAGCCTCCTGAAGACATTGCCCCCAGCCCGGTGCAGAGCA 

GGGGCCCAGCAGCCAGCTGTGGCGCTGGAAACCTGCAACCCCCAGCCCTGCCCTGCC 

AGGTGGGAGGTGTCAGAGCCCAGCTCATGCACATCAGCTGGTGGAGCAGGCCTGGCCT 

TGGAGAACGAGACCTGTGTGCCAGGGGCAGATGGCCTGGAGGCTCCAGTGACTGAGG 

GGCCTGGCTCCGTAGATGAGAAGCTGCCTGCCCCTGAGCCCTGTGTCGGGATGTCATG 

TCCTCCAGGCTGGGGCCATCTGGATGCCACCTCTGCAGGGGAGAAGGCTCCCTCCCCA 

TGGGGCAGCATCAGGACGGGGGCTCAAGCTGCACACGTGTGGACCCCTGCGGCAGGG 

TCGTGCTCCGTCTCCTGCGGGCGAGGTCTGATGGAGCTGCGTTTCCTGTGCATGGACT 

CTGCCCTCAGGGTGCCTGTCCAGGAAGAGCTGTGTGGCCTGGCAAGCAAGCCTGGGAG 

CCGGCGGGAGGTCTGCCAGGCTGTCCCGTGCCCTGCTCGGTGGCAGTACAAGCTGGC 

GGCCTGCAGCGTGAGCTGTGGGAGAGGGGTCGTGCGGAGGATCCTGTATTGTGCCCG 

GGCCCATGGGGAGGACGATGGTGAGGAGATCCTGTTGGACACCCAGTGCCAGGGGGT 

GCCTCGCCCGGAACCCCAGGAGGCCTGCAGCCTGGAGCCCTGCCCACCTAGGTGGAA 

AGTCATGTCCCTTGGCCCATGTTCGGCCAGCTGTGGCCTTGGCACTGCTAGACGCTCG 

GTGGCCTGTGTGCAGCTCGACCAAGGCCAGGACGTGGAGGTGGACGAGGCGGCCTGT 

GCGGCGCTGGTGCGGCCCGAGGCCAGTGTCCCCTGTCTCATTGCCGACTGCACCTACC 

GCTGGCATGTTGGCACCTGGATGGAGTGCTCTGTTTCCTGTGGGGATGGCATCCAGCG 

CCGGCGTGACACCTGCCTCGGACCCCAGGCCCAGGCGCCTGTGCCAGCTGATTTCTGC 

CAGCACTTGCCCAAGCCGGTGACTGTGCGTGGCTGCTGGGCTGGGCCCTGTGTGGGAC 

AGGGTACGCCCAGCCTGGTGCCCCACGAAGAAGCCGCTGCTCCAGGACGGACCACAG 

CCACCCCTGCTGGTGCCTGTGGCAGGCAGCACCTTGAGCCAACAGGAACCATTGACAT 

GCGAGGCCCAGGGCAGGCAGACTGTGCAGTGGCCATTGGGCGGCCCCTCGGGGAGGT 

GGTGACCCTCCGCGTCCTTGAGAGTTCTCTCAACTGCAGTGCGGGGGACATGTTGCTG 

CTTTGGGGCCGGCTCACCTGGAGGAAGATGTGCAGGAAGCTGTTGGACATGACTTTCA 

GCTCCAAGACCAACACGCTGGTGGTGAGGCAGCGCTGCGGGCGGCCAGGAGGTGGGG 

TGCTGCTGCGGTATGGGAGCCAGCTTGCTCCTGAAACCTTCTACAGAGAATGTGACATG 

CAGCTCTTTGGGCCCTGGGGTGAAATCGTGAGCCCCTCGCTGAGTCCAGCCACGAGTA 

ATGCAGGGGGCTGCCGGCTCTTCATTAATGTGGCTCCGCACGCACGGATTGCCATCCAT 

GCCCTGGCCACCAACATGGGCGCTGGGACCGAGGGAGCCAATGCCAGCTACATCTTGA 

TCCGGGACACCCACAGCTTGAGGACCACAGCGTTCCATGGGCAGCAGGTGCTCTACTG 



GGAGTCAGAGAGCAGCCAGGCTGAGATGGAGTTCAGCGAGGGCTTCCTGAAGGCTCAG 
GCCAGCCTGCGGGGCCAGTACTGGACCCTCCAATCATGGGTACCGGAGATGCAGGACC 
CTCAGTCCTGGAAGGGAAAGGAAGGAACC 
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PGRPTRPKAPSHSAPLLGLALLRMHQRHPRARCPPLCVAGILACGFLLGCWGPSHFQQSCL 

GALEPQAVSSYLSPGAPLKGRPPSPGFQRQRQRQRRAAGGILHLELLVAVGPDVFQAHQED 

TERYVLTNLNIGAELLRDPSLGAQFRVHLVKMVILTEPEGAPNITANLTSSLLSVCGWSQTINP 

EDDTDPGHADLVLYITRFDLELPDGNRQVRGVTQLGGACSPTWSCLITEDTGFDLGVTIAHEI 

GHSFGLEHDGAPGSGCGPSGHVMASDGAAPRAGLAWSPCSRRQLLSLLSAGRARCVWDP 

PRPQPGSAGHPPDAQPGLYYSANEQCRVAFGPKAVACTFAREHLDMCQALSCHTDPLDQS 

SCSRLLVPLLDGTECGVEKWCSKGRCRSLVELTPIAAVHGRWSSWGPRSPCSRSCGGGW 

TRRRQCNNPRPAFGGRACVGADLQAEMCNTQACEKTQLEFMSQQCARTDGQPLRSSPGG 

ASFYHWGAAVPHSQGDALCRHMCRAIGESFIMKRGDSFLDGTRCMPSGPREDGTLSLCVS 

GSCRTFGCDGRMDSQQVWDRCQVCGGDNSTCSPRKGSFTAGRAREYVTFLTVTPNLTSV 

YIANHRPLFTHLAVRIGGRYWAGKMSISPNTTYPSLLEDGRVEYRVALTEDRLPRLEEIRIWG 

PLQEDADIQVYRRYGEEYGNLTRPDITFTYFQPKPRQAWVWAAVRGPCSVSCGAGLRWVN 

YSCLDQARKELVETVQCQGSQQPPAWPEACVLEPC'PPYWAVGDFGPCSASCGGGLRERP 

VRCVEAQGSLLKTLPPARCRAGAQQPAVALETCNPQPCPARWEVSEPSSCTSAGGAGLAL 

ENETCVPGADGLEAPVTEGPGSVDEKLPAPEPCVGMSCPPGWGHLDATSAGEKAPSPWG 

SIRTGAQAAHVWTPAAGSCSVSCGRGLMELRFLCMDSALRVPVGEELCGLASKPGSRREV 

CQAVPCPARWQYKLAACSVSCGRGWRRILYCARAHGEDDGEEILLDTQCQGLPRPEPQE 

ACSLEPCPPRWKVMSLGPCSASCGLGTARRSVACVQLDQGQDVEVDEAACAALVRPEASV 

PCLIADCTYRWHVGTWMECSVSCGDGIQRRRDTCLGPQAQAPVPADFCQHLPKPVTVRGC 

WAGPCVGQGTPSLVPHEEAAAPGRTTATPAGACGRQHLEPTGTIDMRGPGQADCAVAIGR 

PLGEVVTLRVLESSLNCSAGDMLLLWGRLTWRKMCRKLLDMTFSSKTNTLWRQRCGRPG 

GGVLLRYGSQLAPETFYRECDMGLFGPWGEIVSPSLSPATSNAGGCRLFINVAPHARIAIHAL 

ATNMGAGTEGANASYILIRDTHSLRTTAFHGQQVLYWESESSQAEMEFSEGFLKAQASLRG 

QYWTLQSWVPEMQDPQSWKGKEGT 




Figure 3 

Domain structure of ADAMTS-M and translated nucleic acid sequence. A) Diagram of ADAMTS-M showing 
the following domains and signature motifs (with amino acid numbers in parentheses): partial prodomain ( 1 -97), 
furin cleavage site (94-97), metalloproteinase domain (98-3 1 1), zinc-binding motif (247-272), disintegrin domain 
(324-394), thrombospondui submotifs (410-473 and 1099-1 156), and a heparin-binding motif (4 1 9-424). B) 
ADAMTS-M nucleotide sequence with translated amino acid sequence above. ' 
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ADAMTS-M 

"l"416 



Thrombospondin 



+ 1PGRP TRP K A F SHSA PLL GLA LLRM HQR 

Partial Prodomain 



2 5 CCGGGTCGAC CCACGCGTCC GAAGGCCCCC TCTCACTCCG CTCCACTCCT CGGGCTGGCT CTCCTGAGGA TGCACCAGCG 
GGCCCAGCTG GGTGCGCAGG CTTCCGGGGG AGAGTGAGGC GAGGTGAGGA GCCCGACCGA GAGGACTCCT ACGTGGTCGC 

+ 1 HPR ARCP PLC V A G ILAC GFL LGC WGF 

Partial Prodomain 

105 TCACCCCCGG GCAAGATGCC CTCCCCTCTG TGTGGCCGGA ATCCTTGCCT GTGGCTTTCT CCTGGGCTGC TGGGGACCCT 
AGTGGGGGCC CGTTCTACGG GAGGGGAGAC ACACCGGCCT TAGGAACGGA CACCGAAAGA GGACCCGACG ACCCCTGGGA 

+ 1SHFQ QSC LQAL EPQ AVS SYLS PGA PLK 
Partial Prodomain 

185 CCCATTTCCA GCAGAGTTGT CTTCAGGCTT TGGAGCCACA GGCCGTGTCT TCTTACTTGA GCCCTGGTGC TCCCTTAAAA 

GGGTAAAGGT CGTCTCAACA GAAGTCCGAA ACCTCGGTGT CCGGCACAGA AGAATGAACT CGGGACCACG AGGGAATTTT 



+ 1GRPP SPG FQR QRQR QRR A A G GIL H LEL 

Furin Cleavage Site (Motif) 



Partial Prodomain 



Metalloproteinase Domain 



2 65 GGCCGCCCTC CTTCCCCTGG CTTCCAGAGG CAGAGGCAGA GGCAGAGGCG GGCTGCAGGC GGCATCCTAC ACCTGGAGCT 
CCGGCGGGAG GAAGGGGACC GAAGGTCTCC GTCTCCGTCT CCGTCTCCGC CCGACGTCCG CCGTAGGATG TGGACCTCGA 

+ 1 LVA VGPD VFQ AHQ EDTE RYV LTN LNI 

Metalloproteinase Domain 



34 5 GCTGGTGGCC GTGGGCCCCG ATGTCTTCCA GGCTCACCAG GAGGACACAG AGCGCTATGT GCTCACCAAC CTCAACATCG 
CGACCACCGG CACCCGGGGC TACAGAAGGT CCGAGTGGTC CTCCTGTGTC TCGCGATACA CGAGTGGTTG GAGTTGTAGC 



+ 1GAEL LRD P S L G . A Q F RVH LVKM VIL TEP 

Metalloproteinase Domain 

4 25 GGGCAGAACT GCTTCGGGAC CCGTCCCTGG GGGCTCAGTT TCGGGTGCAC CTGGTGAAGA TGGTCATTCT GACAGAGCCT 
CCCGTCTTGA CGAAGCCCTG GGCAGGGACC CCCGAGTCAA AGCCCACGTG GACCACTTCT ACCAGTAAGA CTGTCTCGGA 

+ 1EGAP NIT A N L TSSL L SV CGW SQTI NPE 

Metalloproteinase Domain 

50 5 GAGGGTGCTC CAAATATCAC AGCCAACCTC ACCTCGTCCC TGCTGAGCGT CTGTGGGTGG AGCCAGACCA TCAACCCTGA 
CTCCCACGAG GTTTATAGTG TCGGTTGGAG TGGAGCAGGG ACGACTCGCA GACACCCACC TCGGTCTGGT AGTTGGGACT 

+ 1 DDT DPGH ADL VLY ITRF DLE LPD GNR 

Metalloproteinase Domain 



58 5 GGACGACACG GATCCTGGCC ATGCTGACCT GGTCCTCTAT ATCACTAGGT TTGACCTGGA GTTGCCTGAT GGTAACCGGC 
CCTGCTGTGC CTAGGACCGG TACGACTGGA CCAGGAGATA TAGTGATCCA AACTGGACCT CAACGGACTA CCATTGGCCG 

+ 1QVRG VTQ LGGA CSP TWS CLIT EDT GFD 
Metalloproteinase Domain 



6 65 AGGTGCGGGG CGTCACCCAG CTGGGCGGTG CCTGCTCCCC AACCTGGAGC TGCCTCATTA CCGAGGACAC TGGCTTCGAC 
TCCACGCCCC GCAGTGGGTC GACCCGCCAC GGACGAGGGG TTGGACCTCG ACGGAGTAAT GGCTCCTGTG ACCGAAGCTG 

+ 1LGVT I A H E I G HSFG LEH DGA PGSG CGP 

Zinc-binding Motif 

Metalloproteinase Domain 



7 45 CTGGGAGTCA CCATTGCCCA TGAGATTGGG CACAGCTTCG GCCTGGAGCA CGACGGCGCG CCCGGCAGCG GCTGCGGCCC 
GACCCTCAGT GGTAACGGGT ACTCTAACCC GTGTCGAAGC CGGACCTCGT GCTGCCGCGC GGGCCGTCGC CGACGCCGGG 

+ 1 SGH VMAS DGA APR AGLA WSP CSR RQL 
Zinc-binding Motif 

Metalloproteinase Domain 

825 CAGCGGACAC GTGATGGCTT CGGACGGCGC CGCGCCCCGC GCCGGCCTCG CCTGGTCCCC CTGCAGCCGC CGGCAGCTGC 
GTCGCCTGTG CACTACCGAA GCCTGCCGCG GCGCGGGGCG CGGCCGGAGC GGACCAGGGG GACGTCGGCG GCCGTCGACG 

+ 1LSLL SAG RARC VWD PPR PQPG SAG HPP 
Metalloproteinase Domain 

905 TGAGCCTGCT CAGCGCAGGA CGGGCGCGCT GCGTGTGGGA CCCGCCGCGG CCTCAACCCG GGTCCGCGGG GCACCCGCCG 
ACTCGGACGA GTCGCGTCCT GCCCGCGCGA CGCACACCCT GGGCGGCGCC GGAGTTGGGC CCAGGCGCCC CGTGGGCGGC 

+ 1DAQP GLY YSA NEQC RVA FGP K A V A CTF 

Disintegrin Domain 

985 GATGCGCAGC CTGGCCTCTA CTACAGCGCC AACGAGCAGT GCCGCGTGGC CTTCGGCCCC AAGGCTGTCG CCTGCACCTT 
CTACGCGTCG GACCGGAGAT GATGTCGCGG TTGCTCGTCA CGGCGCACCG GAAGCCGGGG TTCCGACAGC GGACGTGGAA 



+ 1 ARE HLDM C Q A LSC HTDP LDQ SSC SRL 

Disintegrin Domain 

1065 CGCCAGGGAG CACCTGGATA TGTGCCAGGC CCTCTCCTGC CACACAGACC CGCTGGACCA AAGCAGCTGC AGCCGCCTCC 

GCGGTCCCTC GTGGACCTAT ACACGGTCCG GGAGAGGACG GTGTGTCTGG GCGACCTGGT TTCGTCGACG TCGGCGGAGG 

+ 1LVPL LDG TECG VEK WCS KGRC RSL VEL 
Disintegrin Domain 



114 5 TCGTTCCTCT CCTGGATGGG ACAGAATGTG GCGTGGAGAA GTGGTGCTCC AAGGGTCGCT GCCGCTCCCT GGTGGAGCTG 
AGCAAGGAGA GGACCTACCC TGTCTTACAC CGCACCTCTT CACCACGAGG TTCCCAGCGA CGGCGAGGGA CCACCTCGAC 

+ 1TPIA A V H GRW SSWG PRS P CS RSCG GGV 

Heparin-binding Motif 



Thrombospondin Submotif 

122 5 ACCCCCATAG CAGCAGTGCA TGGGCGCTGG TCTAGCTGGG GTCCCCGAAG TCCTTGCTCC CGCTCCTGCG GAGGAGGTGT 
TGGGGGTATC GTCGTCACGT ACCCGCGACC AGATCGACCC CAGGGGCTTC AGGAACGAGG GCGAGGACGC CTCCTCCACA 

+ 1 VTR RRQC NNP RPA FGGR ACV GAD LQA 

Thrombospondin Submotif 

1305 GGTCACCAGG AGGCGGCAGT GCAACAACCC CAGACCTGCC TTTGGGGGGC GTGCATGTGT TGGTGCTGAC CTCCAGGCCG 
CCAGTGGTCC TCCGCCGTCA CGTTGTTGGG GTCTGGACGG AAACCCCCCG CACGTACACA ACCACGACTG GAGGTCCGGC 

+ 1EMCN TQA CEKT QLE FMS QQCA RTD GQP 
Thrombospondin Submotif 

138 5 AGATGTGCAA CACTCAGGCC TGCGAGAAGA CCCAGCTGGA GTTCATGTCG CAACAGTGCG CCAGGACCGA CGGCCAGCCG 
TCTACACGTT GTGAGTCCGG ACGCTCTTCT GGGTCGACCT CAAGTACAGC GTTGTCACGC GGTCCTGGCT GCCGGTCGGC 

+ 1LRSS PGG ASF YHWG AAV PHS QGDA LCR 
14 65 CTGCGCTCCT CCCCTGGCGG CGCCTCCTTC TACCACTGGG GTGCTGCTGT ACCACACAGC CAAGGGGATG CTCTGTGCAG 
GACGCGAGGA GGGGACCGCC GCGGAGGAAG ATGGTGACCC CACGACGACA TGGTGTGTCG GTTCCCCTAC GAGACACGTC 

+ 1 HMC RAIG ESF IMK RGDS FLD GTR CMP 
154 5 ACACATGTGC CGGGCCATTG GCGAGAGCTT CATCATGAAG CGTGGAGACA GCTTCCTCGA TGGGACCCGG TGTATGCCAA 
TGTGTACACG GCCCGGTAAC CGCTCTCGAA GTAGTACTTC GCACCTCTGT CGAAGGAGCT ACCCTGGGCC ACATACGGTT 



' +1SGPR EDG TLSL CVS GSC RTFG CDG RMD 
162 5 GTGGCCCCCG GGAGGACGGG ACCCTGAGCC TGTGTGTGTC GGGCAGCTGC AGGACATTTG GCTGTGATGG TAGGATGGAC 
CACCGGGGGC CCTCCTGCCC TGGGACTCGG ACACACACAG CCCGTCGACG TCCTGTAAAC CGACACTACC ATCCTACCTG 



+ 1SQQV WDR CQV CGGD NST CSP RKG S FTA 
17 05 TCCCAGCAGG TATGGGACAG GTGCCAGGTG TGTGGTGGGG ACAACAGCAC GTGCAGCCCA CGGAAGGGCT CTTTCACAGC 
AGGGTCGTCC ATACCCTGTC CACGGTCCAC ACACCACCCC TGTTGTCGTG CACGTCGGGT GCCTTCCCGA GAAAGTGTCG 



+ 1 

1785 



GRA REYV TFL TVT PNLT SVY IAN HRP 
TGGCAGAGCG AGAGAATATG TCACGTTTCT GACAGTTACC CCCAACCTGA CCAGTGTCTA CATTGCCAAC CACAGGCCTC 
ACCGTCTCGC TCTCTTATAC AGTGCAAAGA CTGTCAATGG GGGTTGGACT GGTCACAGAT GTAACGGTTG GTGTCCGGAG 




+ 1LFTH LAV RIGG RYV V A. G KMSI SPN TTV 
18 65 TCTTCACACA CTTGGCGGTG AGGATCGGAG GGCGCTATGT CGTGGCTGGG AAGATGAGCA TCTCCCCTAA CACCACCTAC 
AGAAGTGTGT GAACCGCCAC TCCTAGCCTC CCGCGATACA GCACCGACCC TTCTACTCGT AGAGGGGATT GTGGTGGATG 

+ 1PSLL EDG RVE YRVA LTE DRL PRLE E I R 
194 5 CCCTCCCTCC TGGAGGATGG TCGTGTCGAG TACAGAGTGG CCCTCACCGA GGACCGGCTG CCCCGCCTGG AGGAGATCCG 
GGGAGGGAGG ACCTCCTACC AGCACAGCTC ATGTCTCACC GGGAGTGGCT CCTGGCCGAC GGGGCGGACC TCCTCTAGGC 

+ 1 IWG PLQE DAD I Q V YRRY GEE YGN LTR 
202 5 CATCTGGGGA CCCCTCCAGG AAGATGCTGA CATCCAGGTT TACAGGCGGT ATGGCGAGGA GTATGGCAAC CTCACCCGCC 
GTAGACCCCT GGGGAGGTCC TTCTACGACT GTAGGTCCAA ATGTCCGCCA TACCGCTCCT CATACCGTTG GAGTGGGCGG 

+ 1PDIT FTY FQPK PRQ AWV WAAV RGP CSV 
2105 CAGACATCAC CTTCACCTAC TTCCAGCCTA AGCCACGGCA GGCCTGGGTG TGGGCCGCTG TGCGTGGGCC CTGCTCGGTG 
GTCTGTAGTG GAAGTGGATG AAGGTCGGAT TCGGTGCCGT CCGGACCCAC ACCCGGCGAC ACGCACCCGG GACGAGCCAC 

+ 1SCGA GLR WVN YSCL DQA RKE LVET VQC 
2185 AGCTGTGGGG CAGGGCTGCG CTGGGTAAAC TACAGCTGCC TGGACCAGGC CAGGAAGGAG TTGGTGGAGA CTGTCCAGTG 
TCGACACCCC GTCCCGACGC GACCCATTTG ATGTCGACGG ACCTGGTCCG GTCCTTCCTC AACCACCTCT GACAGGTCAC 

+ 1 QGS QQPP AWP EAC VLEP CPP YWA VGD 
22 65 CCAAGGGAGC CAGCAGCCAC CAGCGTGGCC AGAGGCCTGC GTGCTCGAAC CCTGCCCTCC CTACTGGGCG GTGGGAGACT 
GGTTCCCTCG GTCGTCGGTG GTCGCACCGG TCTCCGGACG CACGAGCTTG GGACGGGAGG GATGACCCGC CACCCTCTGA 

+ 1 F G PC.SAS CGGG LRE RPV RCVE A Q G SLL 
2 34 5 TCGGCCCATG CAGCGCCTCC TGTGGGGGTG GCCTGCGGGA GCGGCCAGTG CGCTGCGTGG AGGCCCAGGG CAGCCTCCTG 
AGCCGGGTAC GTCGCGGAGG ACACCCCCAC CGGACGCCCT CGCCGGTCAC GCGACGCACC TCCGGGTCCC GTCGGAGGAC 

+ 1KTLP PAR CRA GAQQ PAV ALE TCNP QPC 
24 25 AAGACATTGC CCCCAGCCCG GTGCAGAGCA GGGGCCCAGC AGCCAGCTGT GGCGCTGGAA ACCTGCAACC CCCAGCCCTG 
TTCTGTAACG GGGGTCGGGC CACGTCTCGT CCCCGGGTCG TCGGTCGACA CCGCGACCTT TGGACGTTGG GGGTCGGGAC 

+ 1 PAR WEVS EPS SCT SAGG AGL ALE NET 
2 505 CCCTGCCAGG TGGGAGGTGT CAGAGCCCAG CTCATGCACA TCAGCTGGTG GAGCAGGCCT GGCCTTGGAG AACGAGACCT 
GGGACGGTCC ACCCTCCACA GTCTCGGGTC GAGTACGTGT AGTCGACCAC CTCGTCCGGA CCGGAACCTC TTGCTCTGGA 

+ 1CVPG ADG L EAP VTE GPG SVDE K LP APE 
2 58 5 GTGTGCCAGG GGCAGATGGC CTGGAGGCTC CAGTGACTGA GGGGCCTGGC TCCGTAGATG AGAAGCTGCC TGCCCCTGAG 
CACACGGTCC CCGTCTACCG GACCTCCGAG GTCACTGACT CCCCGGACCG AGGCATCTAC TCTTCGACGG ACGGGGACTC 

+ 1PCVG MSC PPG WGHL DAT SAG EKAP SPW 
2 665. CCCTGTGTCG GGATGTCATG TCCTCCAGGC TGGGGCCATC TGGATGCCAC CTCTGCAGGG GAGAAGGCTC CCTCCCCATG 
GGGACACAGC CCTACAGTAC AGGAGGTCCG ACCCCGGTAG ACCTACGGTG GAGACGTCCC CTCTTCCGAG GGAGGGGTAC 

+ 1 G. SI RTGA Q A A HVW TPAA GSC SVS CGR 
27 4 5 GGGCAGCATC AGGACGGGGG CTCAAGCTGC ACACGTGTGG ACCCCTGCGG CAGGGTCGTG CTCCGTCTCC TGCGGGCGAG 
CCCGTCGTAG TCCTGCCCCC GAGTTCGACG TGTGCACACC TGGGGACGCC GTCCCAGCAC GAGGCAGAGG ACGCCCGCTC 

+ 1GLME LRF LCMD SAL RVP VQEE LCG LAS 
2825 GTCTGATGGA GCTGCGTTTC CTGTGCATGG ACTCTGCCCT CAGGGTGCCT GTCCAGGAAG AGCTGTGTGG CCTGGCAAGC 
CAGACTACCT CGACGCAAAG GACACGTACC TGAGACGGGA GTCCCACGGA CAGGTCCTTC TCGACACACC GGACCGTTCG 



+ 1KPGS RRE VCQ A V P C PAR WQY K L A A CSV 
2 905 AAGCCTGGGA GCCGGCGGGA GGTCTGCCAG GCTGTCCCGT GCCCTGCTCG GTGGCAGTAC AAGCTGGCGG CCTGCAGCGT 
TTCGGACCCT CGGCCGCCCT CCAGACGGTC CGACAGGGCA CGGGACGAGC CACCGTCATG TTCGACCGCC GGACGTCGCA 

+ 1 SCG .RGVV RRI LYC ARAH GED DGE E I L 
2 98 5 GAGCTGTGGG AGAGGGGTCG TGCGGAGGAT CCTGTATTGT GCCCGGGCCC ATGGGGAGGA CGATGGTGAG GAGATCCTGT 
CTCGACACCC TCTCCCCAGC ACGCCTCCTA GGACATAACA CGGGCCCGGG TACCCCTCCT GCTACCACTC CTCTAGGACA 

+ 1 L . D T Q CQG LPRP EPQ EAC SLEP CPP RWK 
30 65 TGGACACCCA GTGCCAGGGG CTGCCTCGCC CGGAACCCCA GGAGGCCTGC AGCCTGGAGC CCTGCCCACC TAGGTGGAAA 
ACCTGTGGGT CACGGTCCCC GACGGAGCGG GCCTTGGGGT CCTCCGGACG TCGGACCTCG GGACGGGTGG ATCCACCTTT 

+ 1VMSL GPC SAS CGLG TAR RSV A C V Q LDQ 
314 5 GTCATGTCCC TTGGCCCATG TTCGGCCAGC TGTGGCCTTG GCACTGCTAG ACGCTCGGTG GCCTGTGTGC AGCTCGACCA 
CAGTACAGGG AACCGGGTAC AAGCCGGTCG ACACCGGAAC CGTGACGATC TGCGAGCCAC CGGACACACG TCGAGCTGGT 

+ 1 GQD VEVD E A A CAA LVRP EAS VPC L I A 
32 2 5 AGGCCAGGAC GTGGAGGTGG ACGAGGCGGC CTGTGCGGCG CTGGTGCGGC CCGAGGCCAG TGTCCCCTGT CTCATTGCCG 
TCCGGTCCTG CACCTCCACC TGCTCCGCCG GACACGCCGC GACCACGCCG GGCTCCGGTC ACAGGGGACA GAGTAACGGC 



+ 1DCTY RWH VGTW MEC SVS CGDG IQR RRD 

Thrombospondin Submotif 



33 05 ACTGCACCTA CCGCTGGCAT GTTGGCACCT GGATGGAGTG CTCTGTTTCC TGTGGGGATG GCATCCAGCG CCGGCGTGAC 
TGACGTGGAT GGCGACCGTA CAACCGTGGA CCTACCTCAC GAGACAAAGG ACACCCCTAC CGTAGGTCGC GGCCGCACTG 

+ 1TCLG PQA QAP VPAD FCQ HLP KPVT V R G 
Thrombospondin Submotif 



33 85 ACCTGCCTCG GACCCCAGGC CCAGGCGCCT GTGCCAGCTG ATTTCTGCCA GCACTTGCCC AAGCCGGTGA CTGTGCGTGG 
TGGACGGAGC CTGGGGTCCG GGTCCGCGGA CACGGTCGAC TAAAGACGGT CGTGAACGGG TTCGGCCACT GACACGCACC 

+ 1 CWA GPCV GQG TPS LVPH EEA A A P GRT 
Thrombospondin Submotif 



3 4 65 CTGCTGGGCT GGGCCCTGTG TGGGACAGGG TACGCCCAGC CTGGTGCCCC ACGAAGAAGC CGCTGCTCCA GGACGGACCA 
GACGACCCGA CCCGGGACAC ACCCTGTCCC ATGCGGGTCG GACCACGGGG TGCTTCTTCG GCGACGAGGT CCTGCCTGGT 

+ 1TATP A G A CGRQ HLE PTG TIDM RGP GQA 
354 5 CAGCCACCCC TGCTGGTGCC TGTGGCAGGC AGCACCTTGA GCCAACAGGA ACCATTGACA TGCGAGGCCC AGGGCAGGCA 
GTCGGTGGGG ACGACCACGG ACACCGTCCG TCGTGGAACT CGGTTGTCCT TGGTAACTGT ACGCTCCGGG TCCCGTCCGT 



+ 1DCAV A I G RPL GEVV TLR VLE SSLN CSA 
3 625 GACTGTGCAG TGGCCATTGG GCGGCCCCTC GGGGAGGTGG TGACCCTCCG CGTCCTTGAG AGTTCTCTCA ACTGCAGTGC 
CTGACACGTC ACCGGTAACC CGCCGGGGAG CCCCTCCACC ACTGGGAGGC GCAGGAACTC TCAAGAGAGT TGACGTCACG 



+ 1 GDM LLLW GRL TWR KMCR KLL DMT FSS 
37 05 GGGGGACATG TTGCTGCTTT GGGGCCGGCT CACCTGGAGG AAGATGTGCA GGAAGCTGTT GGACATGACT TTCAGCTCCA 
CCCCCTGTAC AAC G ACG AAA CCCCGGCCGA GTGGACCTCC TTCTACACGT CCTTCGACAA CCTGTACTGA AAGTCGAGGT 



+ 1KTNT LVV RQRC GRP GGG VLLR YGS QLA 
3 7 85 AGACCAACAC GCTGGTGGTG AGGCAGCGCT GCGGGCGGCC AGGAGGTGGG GTGCTGCTGC GGTATGGGAG CCAGCTTGCT 
TCTGGTTGTG CGACCACCAC TCCGTCGCGA CGCCCGCCGG TCCTCCACCC CACGACGACG CCATACCCTC GGTCGAACGA 



[o ■ 




+ 1PETF YRE CDM QLFG PWG EIV SPSL SPA 
38 65 CCTGAAACCT TCTACAGAGA ATGTGACATG CAGCTCTTTG GGCCCTGGGG TGAAATCGTG AGCCCCTCGC TGAGTCCAGC 
GGACTTTGGA AGATGTCTCT TACACTGTAC GTCGAGAAAC CCGGGACCCC ACTTTAGCAC TCGGGGAGCG ACTCAGGTCG 



+ 1 TSN AGGC RLF INV APHA R I A I H A LAT 
3 94 5 CACGAGTAAT GCAGGGGGCT GCCGGCTCTT CATTAATGTG GCTCCGCACG CACGGATTGC CATCCATGCC CTGGCCACCA 
GTGCTCATTA CGTCCCCCGA CGGCCGAGAA GTAATTACAC CGAGGCGTGC GTGCCTAACG GTAGGTACGG GACCGGTGGT 



+ 1NM GA GTE G A N A SYI LIR DTHS L R T 1 T A F 
4025 ACATGGGCGC TGGGACCGAG GGAGCCAATG CCAGCTACAT CTTGATCCGG GACACCCACA GCTTGAGGAC CACAGCGTTC 
TGTACCCGCG ACCCTGGCTC CCTCGGTTAC GGTCGATGTA GAACTAGGCC CTGTGGGTGT CGAACTCCTG GTGTCGC AAG 



+ 1HGQQ VLY WES ESSQ AEM EFS EGFL KAQ 
4105 CATGGGCAGC AGGTGCTCTA CTGGGAGTCA GAGAGCAGCC AGGCTGAGAT GGAGTTCAGC GAGGGCTTCC TGAAGGCTCA 
GTACCCGTCG TCCACGAGAT GACCCTCAGT CTCTCGTCGG TCCGACTCTA CCTCAAGTCG CTCCCGAAGG ACTTCCGAGT 

+ 1 ASL RGQY WTL QSW VPEM QD P QSW KGK 
4185 GGCCAGCCTG CGGGGCCAGT ACTGGACCCT CCAATCATGG GTACCGGAGA TGCAGGACCC TCAGTCCTGG AAGGGAAAGG 
CCGGTCGGAC GCCCCGGTCA TGACCTGGGA GGTTAGTACC CATGGCCTCT ACGTCCTGGG AGTCAGGACC TTCCCTTTCC 



+ 1 E G T 
4 2 65 AAGGAACC 
TTCCTTGG 
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